| Model | Dimensions| Arabic (ar) | Bengali (bn) | English (en) | Spanish (es) | Persian (fa) | Finnish (fi) | French (fr) | Hindi (hi) | Indonesian (id) | Japanese (ja) | Korean (ko) | Russian (ru) | Swahili (sw) | Telugu (te) | Thai (th) | Chinese (zh) | Germany (de) | Yoruba (yo) | Avg (18 datasets) | Avg (excl. de and yo) |
| -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- | -------- |
| embed-multilingual-v3.0 | 1024 | 76.5 | 75.8 | 57.0 | 55.1 | 57.5 | 77.1 | 57.4 | 61.7 | 52.5 | 69.6 | 66.0 | 68.8 | 75.7 | 83.3 | 79.5 | 58.9 | 58.7 | 61.8 | 66.3 | 67.0 |
| BM25 | N/A | 48.1 | 50.8 | 35.1 | 31.9 | 33.3 | 55.1 | 18.3 | 45.8 | 44.9 | 36.9 | 41.9 | 33.4 | 38.3 | 49.4 | 48.8 | 18.0 | N/A | N/A | N/A | 39.4 |
| OpenAI text-embedding-large  | 3072 | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | 54.9 | N/A |

Note: OpenAI has only reported on the average of their Miracl's Scores; Cohere's embed-multilingual-v3.0 model is SOTA in Multilingual Retrieval